Overview
Dataset statistics
| Number of variables | 12 |
|---|---|
| Number of observations | 3172 |
| Missing cells | 797 |
| Missing cells (%) | 2.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 322.2 KiB |
| Average record size in memory | 104.0 B |
Variable types
| Categorical | 3 |
|---|---|
| Numeric | 9 |
PropertyGFABuilding(s) is highly overall correlated with SiteEnergyUse(kBtu) | High correlation |
SiteEnergyUse(kBtu) is highly overall correlated with PropertyGFABuilding(s) | High correlation |
Use_Steam is highly imbalanced (77.4%) | Imbalance |
ENERGYSTARScore has 797 (25.1%) missing values | Missing |
SiteEnergyUse(kBtu) has unique values | Unique |
NumberofBuildings has 90 (2.8%) zeros | Zeros |
PropertyGFAParking has 2694 (84.9%) zeros | Zeros |
Reproduction
| Analysis started | 2025-12-15 14:14:24.450394 |
|---|---|
| Analysis finished | 2025-12-15 14:14:34.316432 |
| Duration | 9.87 seconds |
| Software version | ydata-profiling vv4.18.0 |
| Download configuration | config.json |
Variables
PrimaryPropertyType
Categorical
| Distinct | 23 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 49.6 KiB |
| Low-Rise Multifamily | |
|---|---|
| Mid-Rise Multifamily | |
| Small- and Mid-Sized Office | |
| Other | |
| Warehouse | |
| Other values (18) |
Length
| Max length | 27 |
|---|---|
| Median length | 22 |
| Mean length | 17.430013 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Hotel |
|---|---|
| 2nd row | Hotel |
| 3rd row | Hotel |
| 4th row | Hotel |
| 5th row | Other |
Common Values
| Value | Count | Frequency (%) |
| Low-Rise Multifamily | 952 | |
| Mid-Rise Multifamily | 553 | |
| Small- and Mid-Sized Office | 286 | 9.0% |
| Other | 244 | 7.7% |
| Warehouse | 184 | 5.8% |
| Large Office | 156 | 4.9% |
| Mixed Use Property | 128 | 4.0% |
| High-Rise Multifamily | 103 | 3.2% |
| Retail Store | 85 | 2.7% |
| Hotel | 74 | 2.3% |
| Other values (13) | 407 |
Length
| Value | Count | Frequency (%) |
| multifamily | 1608 | |
| low-rise | 952 | |
| mid-rise | 553 | 8.4% |
| office | 480 | 7.3% |
| and | 286 | 4.3% |
| small | 286 | 4.3% |
| mid-sized | 286 | 4.3% |
| other | 244 | 3.7% |
| warehouse | 196 | 3.0% |
| large | 156 | 2.4% |
| Other values (28) | 1569 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 7379 | 13.3% |
| e | 4382 | 7.9% |
| l | 4211 | 7.6% |
| 3444 | 6.2% | |
| a | 2948 | 5.3% |
| t | 2706 | 4.9% |
| M | 2613 | 4.7% |
| f | 2608 | 4.7% |
| - | 2258 | 4.1% |
| s | 2117 | 3.8% |
| Other values (33) | 20622 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 55288 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| i | 7379 | 13.3% |
| e | 4382 | 7.9% |
| l | 4211 | 7.6% |
| 3444 | 6.2% | |
| a | 2948 | 5.3% |
| t | 2706 | 4.9% |
| M | 2613 | 4.7% |
| f | 2608 | 4.7% |
| - | 2258 | 4.1% |
| s | 2117 | 3.8% |
| Other values (33) | 20622 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 55288 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| i | 7379 | 13.3% |
| e | 4382 | 7.9% |
| l | 4211 | 7.6% |
| 3444 | 6.2% | |
| a | 2948 | 5.3% |
| t | 2706 | 4.9% |
| M | 2613 | 4.7% |
| f | 2608 | 4.7% |
| - | 2258 | 4.1% |
| s | 2117 | 3.8% |
| Other values (33) | 20622 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 55288 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| i | 7379 | 13.3% |
| e | 4382 | 7.9% |
| l | 4211 | 7.6% |
| 3444 | 6.2% | |
| a | 2948 | 5.3% |
| t | 2706 | 4.9% |
| M | 2613 | 4.7% |
| f | 2608 | 4.7% |
| - | 2258 | 4.1% |
| s | 2117 | 3.8% |
| Other values (33) | 20622 |
Latitude
Real number (ℝ)
| Distinct | 2719 |
|---|---|
| Distinct (%) | 85.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 47.624785 |
| Minimum | 47.50224 |
|---|---|
| Maximum | 47.73387 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 49.6 KiB |
Quantile statistics
| Minimum | 47.50224 |
|---|---|
| 5-th percentile | 47.54393 |
| Q1 | 47.600962 |
| median | 47.61918 |
| Q3 | 47.657157 |
| 95-th percentile | 47.713065 |
| Maximum | 47.73387 |
| Range | 0.23163 |
| Interquartile range (IQR) | 0.056195 |
Descriptive statistics
| Standard deviation | 0.047117487 |
|---|---|
| Coefficient of variation (CV) | 0.00098934803 |
| Kurtosis | -0.10822197 |
| Mean | 47.624785 |
| Median Absolute Deviation (MAD) | 0.02647 |
| Skewness | 0.1601006 |
| Sum | 151065.82 |
| Variance | 0.0022200576 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 47.66246 | 9 | 0.3% |
| 47.61598 | 7 | 0.2% |
| 47.62208 | 6 | 0.2% |
| 47.52549 | 5 | 0.2% |
| 47.61543 | 5 | 0.2% |
| 47.62395 | 5 | 0.2% |
| 47.60071 | 4 | 0.1% |
| 47.52254 | 4 | 0.1% |
| 47.6239 | 4 | 0.1% |
| 47.59938 | 4 | 0.1% |
| Other values (2709) | 3119 |
| Value | Count | Frequency (%) |
| 47.50224 | 1 | |
| 47.50959 | 1 | |
| 47.51018 | 1 | |
| 47.51042 | 1 | |
| 47.51098 | 1 | |
| 47.51104 | 1 | |
| 47.51127 | 2 | |
| 47.51168 | 1 | |
| 47.51169 | 1 | |
| 47.51304 | 1 |
| Value | Count | Frequency (%) |
| 47.73387 | 1 | |
| 47.73375 | 1 | |
| 47.73368 | 1 | |
| 47.7336 | 1 | |
| 47.73357 | 1 | |
| 47.73351 | 1 | |
| 47.73331 | 1 | |
| 47.73316 | 1 | |
| 47.73315 | 1 | |
| 47.73279 | 1 |
Longitude
Real number (ℝ)
| Distinct | 2511 |
|---|---|
| Distinct (%) | 79.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -122.33521 |
| Minimum | -122.41425 |
|---|---|
| Maximum | -122.26028 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 3172 |
| Negative (%) | 100.0% |
| Memory size | 49.6 KiB |
Quantile statistics
| Minimum | -122.41425 |
|---|---|
| 5-th percentile | -122.38643 |
| Q1 | -122.35076 |
| median | -122.33264 |
| Q3 | -122.32022 |
| 95-th percentile | -122.29181 |
| Maximum | -122.26028 |
| Range | 0.15397 |
| Interquartile range (IQR) | 0.0305325 |
Descriptive statistics
| Standard deviation | 0.026645138 |
|---|---|
| Coefficient of variation (CV) | -0.00021780433 |
| Kurtosis | 0.24891371 |
| Mean | -122.33521 |
| Median Absolute Deviation (MAD) | 0.014895 |
| Skewness | -0.17813393 |
| Sum | -388047.27 |
| Variance | 0.00070996337 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -122.29898 | 8 | 0.3% |
| -122.35398 | 7 | 0.2% |
| -122.33369 | 6 | 0.2% |
| -122.32468 | 6 | 0.2% |
| -122.31769 | 5 | 0.2% |
| -122.33379 | 5 | 0.2% |
| -122.32417 | 5 | 0.2% |
| -122.33064 | 5 | 0.2% |
| -122.32592 | 5 | 0.2% |
| -122.32811 | 4 | 0.1% |
| Other values (2501) | 3116 |
| Value | Count | Frequency (%) |
| -122.41425 | 1 | |
| -122.41182 | 1 | |
| -122.41178 | 1 | |
| -122.41169 | 1 | |
| -122.41037 | 1 | |
| -122.41036 | 1 | |
| -122.41031 | 1 | |
| -122.40976 | 1 | |
| -122.40974 | 1 | |
| -122.40901 | 1 |
| Value | Count | Frequency (%) |
| -122.26028 | 1 | |
| -122.26034 | 1 | |
| -122.26166 | 2 | |
| -122.26172 | 1 | |
| -122.26177 | 1 | |
| -122.2618 | 1 | |
| -122.26216 | 1 | |
| -122.26223 | 1 | |
| -122.26235 | 1 | |
| -122.26277 | 1 |
YearBuilt
Real number (ℝ)
| Distinct | 113 |
|---|---|
| Distinct (%) | 3.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1968.6337 |
| Minimum | 1900 |
|---|---|
| Maximum | 2015 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 49.6 KiB |
Quantile statistics
| Minimum | 1900 |
|---|---|
| 5-th percentile | 1908 |
| Q1 | 1948 |
| median | 1975 |
| Q3 | 1997 |
| 95-th percentile | 2012 |
| Maximum | 2015 |
| Range | 115 |
| Interquartile range (IQR) | 49 |
Descriptive statistics
| Standard deviation | 33.219065 |
|---|---|
| Coefficient of variation (CV) | 0.016874173 |
| Kurtosis | -0.88331443 |
| Mean | 1968.6337 |
| Median Absolute Deviation (MAD) | 24 |
| Skewness | -0.53754265 |
| Sum | 6244506 |
| Variance | 1103.5063 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2014 | 67 | 2.1% |
| 2000 | 66 | 2.1% |
| 1968 | 62 | 2.0% |
| 2008 | 60 | 1.9% |
| 1988 | 59 | 1.9% |
| 1989 | 58 | 1.8% |
| 1999 | 57 | 1.8% |
| 1970 | 55 | 1.7% |
| 2001 | 54 | 1.7% |
| 2002 | 54 | 1.7% |
| Other values (103) | 2580 |
| Value | Count | Frequency (%) |
| 1900 | 51 | |
| 1901 | 7 | 0.2% |
| 1902 | 11 | 0.3% |
| 1903 | 3 | 0.1% |
| 1904 | 14 | 0.4% |
| 1905 | 9 | 0.3% |
| 1906 | 18 | 0.6% |
| 1907 | 31 | |
| 1908 | 26 | |
| 1909 | 29 |
| Value | Count | Frequency (%) |
| 2015 | 35 | |
| 2014 | 67 | |
| 2013 | 50 | |
| 2012 | 35 | |
| 2011 | 15 | 0.5% |
| 2010 | 23 | 0.7% |
| 2009 | 39 | |
| 2008 | 60 | |
| 2007 | 41 | |
| 2006 | 44 |
NumberofBuildings
Real number (ℝ)
Zeros
| Distinct | 15 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.0655738 |
| Minimum | 0 |
|---|---|
| Maximum | 27 |
| Zeros | 90 |
| Zeros (%) | 2.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 49.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 1 |
| Maximum | 27 |
| Range | 27 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.89681462 |
|---|---|
| Coefficient of variation (CV) | 0.84162603 |
| Kurtosis | 387.8697 |
| Mean | 1.0655738 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 16.922782 |
| Sum | 3380 |
| Variance | 0.80427646 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 2990 | |
| 0 | 90 | 2.8% |
| 2 | 36 | 1.1% |
| 3 | 21 | 0.7% |
| 4 | 12 | 0.4% |
| 5 | 9 | 0.3% |
| 6 | 3 | 0.1% |
| 8 | 3 | 0.1% |
| 10 | 2 | 0.1% |
| 11 | 1 | < 0.1% |
| Other values (5) | 5 | 0.2% |
| Value | Count | Frequency (%) |
| 0 | 90 | 2.8% |
| 1 | 2990 | |
| 2 | 36 | 1.1% |
| 3 | 21 | 0.7% |
| 4 | 12 | 0.4% |
| 5 | 9 | 0.3% |
| 6 | 3 | 0.1% |
| 8 | 3 | 0.1% |
| 9 | 1 | < 0.1% |
| 10 | 2 | 0.1% |
| Value | Count | Frequency (%) |
| 27 | 1 | < 0.1% |
| 23 | 1 | < 0.1% |
| 16 | 1 | < 0.1% |
| 14 | 1 | < 0.1% |
| 11 | 1 | < 0.1% |
| 10 | 2 | 0.1% |
| 9 | 1 | < 0.1% |
| 8 | 3 | 0.1% |
| 6 | 3 | 0.1% |
| 5 | 9 |
NumberofFloors
Real number (ℝ)
| Distinct | 43 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.6270492 |
| Minimum | 0 |
|---|---|
| Maximum | 99 |
| Zeros | 14 |
| Zeros (%) | 0.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 49.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 4 |
| Q3 | 5 |
| 95-th percentile | 11.45 |
| Maximum | 99 |
| Range | 99 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 4.8664213 |
|---|---|
| Coefficient of variation (CV) | 1.0517332 |
| Kurtosis | 62.009403 |
| Mean | 4.6270492 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 5.7213781 |
| Sum | 14677 |
| Variance | 23.682056 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 666 | |
| 3 | 648 | |
| 1 | 424 | |
| 2 | 394 | |
| 6 | 294 | |
| 5 | 289 | |
| 7 | 144 | 4.5% |
| 8 | 59 | 1.9% |
| 11 | 32 | 1.0% |
| 10 | 31 | 1.0% |
| Other values (33) | 191 | 6.0% |
| Value | Count | Frequency (%) |
| 0 | 14 | 0.4% |
| 1 | 424 | |
| 2 | 394 | |
| 3 | 648 | |
| 4 | 666 | |
| 5 | 289 | |
| 6 | 294 | |
| 7 | 144 | 4.5% |
| 8 | 59 | 1.9% |
| 9 | 18 | 0.6% |
| Value | Count | Frequency (%) |
| 99 | 1 | < 0.1% |
| 49 | 1 | < 0.1% |
| 42 | 3 | |
| 41 | 2 | |
| 40 | 1 | < 0.1% |
| 39 | 1 | < 0.1% |
| 38 | 1 | < 0.1% |
| 37 | 2 | |
| 36 | 2 | |
| 34 | 1 | < 0.1% |
PropertyGFAParking
Real number (ℝ)
Zeros
| Distinct | 470 |
|---|---|
| Distinct (%) | 14.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7495.4092 |
| Minimum | 0 |
|---|---|
| Maximum | 407795 |
| Zeros | 2694 |
| Zeros (%) | 84.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 49.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 44745.1 |
| Maximum | 407795 |
| Range | 407795 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 28997.958 |
|---|---|
| Coefficient of variation (CV) | 3.8687625 |
| Kurtosis | 47.473691 |
| Mean | 7495.4092 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 6.0498259 |
| Sum | 23775438 |
| Variance | 8.4088156 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2694 | |
| 13320 | 3 | 0.1% |
| 22000 | 2 | 0.1% |
| 30000 | 2 | 0.1% |
| 10800 | 2 | 0.1% |
| 25800 | 2 | 0.1% |
| 12960 | 2 | 0.1% |
| 100176 | 2 | 0.1% |
| 20416 | 2 | 0.1% |
| 7600 | 1 | < 0.1% |
| Other values (460) | 460 | 14.5% |
| Value | Count | Frequency (%) |
| 0 | 2694 | |
| 38 | 1 | < 0.1% |
| 260 | 1 | < 0.1% |
| 415 | 1 | < 0.1% |
| 604 | 1 | < 0.1% |
| 756 | 1 | < 0.1% |
| 800 | 1 | < 0.1% |
| 919 | 1 | < 0.1% |
| 1263 | 1 | < 0.1% |
| 1392 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 407795 | 1 | |
| 368980 | 1 | |
| 335109 | 1 | |
| 303707 | 1 | |
| 285688 | 1 | |
| 272900 | 1 | |
| 239252 | 1 | |
| 228668 | 1 | |
| 206597 | 1 | |
| 206580 | 1 |
PropertyGFABuilding(s)
Real number (ℝ)
High correlation
| Distinct | 3000 |
|---|---|
| Distinct (%) | 94.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 76939.364 |
| Minimum | 3636 |
|---|---|
| Maximum | 1172127 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 49.6 KiB |
Quantile statistics
| Minimum | 3636 |
|---|---|
| 5-th percentile | 21012.15 |
| Q1 | 27485.25 |
| median | 42295.5 |
| Q3 | 82015.25 |
| 95-th percentile | 261089.5 |
| Maximum | 1172127 |
| Range | 1168491 |
| Interquartile range (IQR) | 54530 |
Descriptive statistics
| Standard deviation | 99287.823 |
|---|---|
| Coefficient of variation (CV) | 1.2904685 |
| Kurtosis | 26.47592 |
| Mean | 76939.364 |
| Median Absolute Deviation (MAD) | 18149 |
| Skewness | 4.3020431 |
| Sum | 2.4405166 × 108 |
| Variance | 9.8580717 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 36000 | 9 | 0.3% |
| 25920 | 8 | 0.3% |
| 28800 | 7 | 0.2% |
| 21600 | 7 | 0.2% |
| 24000 | 6 | 0.2% |
| 30720 | 4 | 0.1% |
| 30240 | 4 | 0.1% |
| 22320 | 4 | 0.1% |
| 45000 | 3 | 0.1% |
| 25200 | 3 | 0.1% |
| Other values (2990) | 3117 |
| Value | Count | Frequency (%) |
| 3636 | 1 | |
| 10925 | 1 | |
| 11285 | 1 | |
| 11440 | 1 | |
| 11685 | 1 | |
| 11968 | 1 | |
| 12769 | 1 | |
| 12806 | 1 | |
| 13157 | 1 | |
| 14101 | 1 |
| Value | Count | Frequency (%) |
| 1172127 | 1 | |
| 1047934 | 1 | |
| 1004813 | 1 | |
| 970647 | 1 | |
| 962428 | 1 | |
| 934292 | 1 | |
| 888049 | 1 | |
| 861702 | 1 | |
| 794592 | 1 | |
| 791396 | 1 |
ENERGYSTARScore
Real number (ℝ)
Missing
| Distinct | 100 |
|---|---|
| Distinct (%) | 4.2% |
| Missing | 797 |
| Missing (%) | 25.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 67.231579 |
| Minimum | 1 |
|---|---|
| Maximum | 100 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 49.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 12 |
| Q1 | 52 |
| median | 74 |
| Q3 | 89 |
| 95-th percentile | 99 |
| Maximum | 100 |
| Range | 99 |
| Interquartile range (IQR) | 37 |
Descriptive statistics
| Standard deviation | 26.941015 |
|---|---|
| Coefficient of variation (CV) | 0.40071965 |
| Kurtosis | -0.29888023 |
| Mean | 67.231579 |
| Median Absolute Deviation (MAD) | 18 |
| Skewness | -0.81724458 |
| Sum | 159675 |
| Variance | 725.81829 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 100 | 87 | 2.7% |
| 98 | 70 | 2.2% |
| 96 | 62 | 2.0% |
| 89 | 52 | 1.6% |
| 93 | 51 | 1.6% |
| 95 | 49 | 1.5% |
| 91 | 47 | 1.5% |
| 99 | 47 | 1.5% |
| 92 | 46 | 1.5% |
| 81 | 46 | 1.5% |
| Other values (90) | 1818 | |
| (Missing) | 797 |
| Value | Count | Frequency (%) |
| 1 | 33 | |
| 2 | 10 | 0.3% |
| 3 | 13 | 0.4% |
| 4 | 5 | 0.2% |
| 5 | 8 | 0.3% |
| 6 | 8 | 0.3% |
| 7 | 10 | 0.3% |
| 8 | 9 | 0.3% |
| 9 | 5 | 0.2% |
| 10 | 9 | 0.3% |
| Value | Count | Frequency (%) |
| 100 | 87 | |
| 99 | 47 | |
| 98 | 70 | |
| 97 | 43 | |
| 96 | 62 | |
| 95 | 49 | |
| 94 | 44 | |
| 93 | 51 | |
| 92 | 46 | |
| 91 | 47 |
SiteEnergyUse(kBtu)
Real number (ℝ)
High correlation Unique
| Distinct | 3172 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4189992.5 |
| Minimum | 57133.199 |
|---|---|
| Maximum | 98960776 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 49.6 KiB |
Quantile statistics
| Minimum | 57133.199 |
|---|---|
| 5-th percentile | 520911.44 |
| Q1 | 934115.56 |
| median | 1787633.5 |
| Q3 | 4167798.2 |
| 95-th percentile | 16279284 |
| Maximum | 98960776 |
| Range | 98903643 |
| Interquartile range (IQR) | 3233682.6 |
Descriptive statistics
| Standard deviation | 7132304.3 |
|---|---|
| Coefficient of variation (CV) | 1.7022236 |
| Kurtosis | 34.42321 |
| Mean | 4189992.5 |
| Median Absolute Deviation (MAD) | 1047125.3 |
| Skewness | 4.8456996 |
| Sum | 1.3290656 × 1010 |
| Variance | 5.0869764 × 1013 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7226362.5 | 1 | < 0.1% |
| 8387933 | 1 | < 0.1% |
| 6794584 | 1 | < 0.1% |
| 14172606 | 1 | < 0.1% |
| 12086616 | 1 | < 0.1% |
| 5758795 | 1 | < 0.1% |
| 6298131.5 | 1 | < 0.1% |
| 13723820 | 1 | < 0.1% |
| 4573777 | 1 | < 0.1% |
| 16016644 | 1 | < 0.1% |
| Other values (3162) | 3162 |
| Value | Count | Frequency (%) |
| 57133.19922 | 1 | |
| 79711.79688 | 1 | |
| 90558.70313 | 1 | |
| 97690.39844 | 1 | |
| 106918 | 1 | |
| 111969.7031 | 1 | |
| 113130 | 1 | |
| 116486.6016 | 1 | |
| 117438.3984 | 1 | |
| 123767.2031 | 1 |
| Value | Count | Frequency (%) |
| 98960776 | 1 | |
| 90609640 | 1 | |
| 68090728 | 1 | |
| 65336980 | 1 | |
| 65047284 | 1 | |
| 59107620 | 1 | |
| 58761304 | 1 | |
| 57764408 | 1 | |
| 56485204 | 1 | |
| 53166156 | 1 |
Use_Steam
Categorical
Imbalance
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 49.6 KiB |
| 0 | |
|---|---|
| 1 | 116 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 0 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 3056 | |
| 1 | 116 | 3.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 3056 | |
| 1 | 116 | 3.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3056 | |
| 1 | 116 | 3.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3172 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 3056 | |
| 1 | 116 | 3.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3172 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 3056 | |
| 1 | 116 | 3.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3172 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 3056 | |
| 1 | 116 | 3.7% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 1983 | |
| 0 | 1189 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 1983 | |
| 0 | 1189 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1983 | |
| 0 | 1189 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3172 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 1983 | |
| 0 | 1189 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3172 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 1983 | |
| 0 | 1189 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3172 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 1983 | |
| 0 | 1189 |
Interactions
Correlations
| ENERGYSTARScore | Latitude | Longitude | NumberofBuildings | NumberofFloors | PrimaryPropertyType | PropertyGFABuilding(s) | PropertyGFAParking | SiteEnergyUse(kBtu) | Use_Gas | Use_Steam | YearBuilt | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ENERGYSTARScore | 1.000 | 0.092 | -0.040 | 0.052 | 0.157 | 0.118 | 0.081 | 0.017 | -0.185 | 0.109 | 0.000 | 0.082 |
| Latitude | 0.092 | 1.000 | -0.014 | 0.058 | 0.060 | 0.216 | -0.056 | 0.019 | -0.088 | 0.158 | 0.279 | 0.150 |
| Longitude | -0.040 | -0.014 | 1.000 | 0.026 | -0.106 | 0.149 | -0.024 | -0.050 | 0.017 | 0.087 | 0.178 | -0.050 |
| NumberofBuildings | 0.052 | 0.058 | 0.026 | 1.000 | -0.027 | 0.132 | 0.055 | 0.006 | 0.042 | 0.029 | 0.000 | 0.038 |
| NumberofFloors | 0.157 | 0.060 | -0.106 | -0.027 | 1.000 | 0.349 | 0.453 | 0.250 | 0.286 | 0.038 | 0.269 | 0.301 |
| PrimaryPropertyType | 0.118 | 0.216 | 0.149 | 0.132 | 0.349 | 1.000 | 0.180 | 0.157 | 0.283 | 0.345 | 0.269 | 0.189 |
| PropertyGFABuilding(s) | 0.081 | -0.056 | -0.024 | 0.055 | 0.453 | 0.180 | 1.000 | 0.227 | 0.741 | 0.116 | 0.194 | 0.287 |
| PropertyGFAParking | 0.017 | 0.019 | -0.050 | 0.006 | 0.250 | 0.157 | 0.227 | 1.000 | 0.308 | 0.000 | 0.000 | 0.240 |
| SiteEnergyUse(kBtu) | -0.185 | -0.088 | 0.017 | 0.042 | 0.286 | 0.283 | 0.741 | 0.308 | 1.000 | 0.122 | 0.229 | 0.160 |
| Use_Gas | 0.109 | 0.158 | 0.087 | 0.029 | 0.038 | 0.345 | 0.116 | 0.000 | 0.122 | 1.000 | 0.021 | 0.345 |
| Use_Steam | 0.000 | 0.279 | 0.178 | 0.000 | 0.269 | 0.269 | 0.194 | 0.000 | 0.229 | 0.021 | 1.000 | 0.168 |
| YearBuilt | 0.082 | 0.150 | -0.050 | 0.038 | 0.301 | 0.189 | 0.287 | 0.240 | 0.160 | 0.345 | 0.168 | 1.000 |
Missing values
Sample
| PrimaryPropertyType | Latitude | Longitude | YearBuilt | NumberofBuildings | NumberofFloors | PropertyGFAParking | PropertyGFABuilding(s) | ENERGYSTARScore | SiteEnergyUse(kBtu) | Use_Steam | Use_Gas | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | Hotel | 47.61220 | -122.33799 | 1927 | 1.0 | 12 | 0 | 88434 | 60.0 | 7226362.5 | 1 | 1 |
| 1 | Hotel | 47.61317 | -122.33393 | 1996 | 1.0 | 11 | 15064 | 88502 | 61.0 | 8387933.0 | 0 | 1 |
| 3 | Hotel | 47.61412 | -122.33664 | 1926 | 1.0 | 10 | 0 | 61320 | 56.0 | 6794584.0 | 1 | 1 |
| 4 | Hotel | 47.61375 | -122.34047 | 1980 | 1.0 | 18 | 62000 | 113580 | 75.0 | 14172606.0 | 0 | 1 |
| 5 | Other | 47.61623 | -122.33657 | 1999 | 1.0 | 2 | 37198 | 60090 | NaN | 12086616.0 | 0 | 1 |
| 6 | Hotel | 47.61390 | -122.33283 | 1926 | 1.0 | 11 | 0 | 83008 | 27.0 | 5758795.0 | 0 | 1 |
| 7 | Other | 47.61327 | -122.33136 | 1926 | 1.0 | 8 | 0 | 102761 | NaN | 6298131.5 | 1 | 1 |
| 8 | Hotel | 47.60294 | -122.33263 | 1904 | 1.0 | 15 | 0 | 163984 | 43.0 | 13723820.0 | 0 | 1 |
| 9 | Mid-Rise Multifamily | 47.60284 | -122.33184 | 1910 | 1.0 | 6 | 1496 | 62216 | 1.0 | 4573777.0 | 1 | 1 |
| 10 | Hotel | 47.60695 | -122.33414 | 1969 | 1.0 | 11 | 19279 | 133884 | 30.0 | 16016644.0 | 1 | 1 |
| PrimaryPropertyType | Latitude | Longitude | YearBuilt | NumberofBuildings | NumberofFloors | PropertyGFAParking | PropertyGFABuilding(s) | ENERGYSTARScore | SiteEnergyUse(kBtu) | Use_Steam | Use_Gas | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 3363 | Other | 47.72126 | -122.29735 | 1949 | 1.0 | 1 | 0 | 11285 | NaN | 6.456654e+05 | 0 | 1 |
| 3364 | Other | 47.67295 | -122.39228 | 1911 | 1.0 | 1 | 0 | 16795 | NaN | 9.366165e+05 | 0 | 1 |
| 3365 | Other | 47.67734 | -122.37624 | 1972 | 1.0 | 1 | 0 | 12769 | NaN | 5.117308e+06 | 0 | 1 |
| 3367 | Other | 47.63228 | -122.31574 | 1912 | 1.0 | 1 | 0 | 23445 | NaN | 5.976246e+06 | 0 | 1 |
| 3368 | Mixed Use Property | 47.60775 | -122.30225 | 1994 | 1.0 | 1 | 0 | 20050 | NaN | 1.813404e+06 | 0 | 1 |
| 3370 | Other | 47.54067 | -122.37441 | 1982 | 1.0 | 1 | 0 | 18261 | NaN | 9.320821e+05 | 0 | 1 |
| 3372 | Other | 47.59625 | -122.32283 | 2004 | 1.0 | 1 | 0 | 16000 | NaN | 9.502762e+05 | 0 | 1 |
| 3373 | Other | 47.63644 | -122.35784 | 1974 | 1.0 | 1 | 0 | 13157 | NaN | 5.765898e+06 | 0 | 1 |
| 3374 | Mixed Use Property | 47.52832 | -122.32431 | 1989 | 1.0 | 1 | 0 | 14101 | NaN | 7.194712e+05 | 0 | 1 |
| 3375 | Mixed Use Property | 47.53939 | -122.29536 | 1938 | 1.0 | 1 | 0 | 18258 | NaN | 1.152896e+06 | 0 | 1 |